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Abstract 

We give sufficient conditions for the asymptotic normality of linear combinations of order 
statistics (L-statistics) in the case of simple random samples without replacement. In the first 
case, restrictions are imposed on the weights of L-statistics. The second case is on trimmed 
means, where we introduce a new finite population smoothness condition. 
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1 Introduction and results 

In the case of independent and identically distributed (i.i.d.) observations, asymptotic normality of 
L-statistics under various conditions was shown by Chernoff et al. [I], Shorack [12], Stigler |13| 114] 
and Mason [9], among others. See also Serfling [101 Chapter 8]. In the case of samples drawn without 
replacement, there are only the few works on the asymptotic normality of L-statistics, e.g., the paper 
of Shao [11] , where L-statistics under complex sampling designs are considered, and the work of 
Chatterjee [3] on the case of sample quantile. 

Let X = {x\ , . . . , xjv} denote measurements of the study variable x of the population U = 
{ui, . . . ,ujv} of subjects or objects, i.e., a real function /: U — > R assigns a fixed value for each 
element of the population U. Let X = {X\, . . . ,X n } be measurements of units of the simple random 
sample of size n < N drawn without replacement from the population. The observations X\, . . . , X n 
are identically distributed, but they are not independent. Let X\- n < ■ ■ ■ < X n:n denote the order 
statistics of X. Define the L-statistic 

1 n 

L n = L n {X) = -Y J C 3 X r . n . (1) 

H 3=1 

Here c\, . . . , c n is a given sequence of real numbers called weights. Usually these weights are deter- 
mined by the weight function J: (0, 1) — > R as follows: 

Further, when we talk about the asymptotics of L-statistics, we use centered statistics ([T]) with n 1 / 2 
norming, i.e., 

S n = S n (X) = n 1 / 2 (L n -EL n ). (2) 
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Denote a 2 . = Var S n . We are interested in the normal approximation to the distribution function 

F n (x) = P {S n < xa n } . 

Note that for correct formulations of the following asymptotic results for finite population statistics, 
we need to consider a sequence of populations X r = {x rt \, . . . , x r ^ r }, with N r — > oo as r — >■ oo, 
and a sequence of statistics L nr (H r ), based on simple random samples X r = {X Tt \, . . . ,X rtTlr } drawn 
without replacement from X r . In order to keep the notation simple, we shall skip the subscript r in 
what follows. 

The sample mean is the separate case of (pQ), where Cj = 1, 1 < j < re. In this case, for samples 
drawn without replacement, the classical result on asymptotic normality was established by Erdos 
and Renyi [7J, see also Hajek [8]. Similarly as in the case of i.i.d. observations, the key asymptotic 
condition in [7] is the Lindeberg-type condition: for every e > 0, 

a~ 2 B(X 1 -EX 1 ) 2 I{\Xx-EX 1 \ > era} = o(l) as N,n ^ oo, (3) 

where a 1 = VarXi and r 2 = Npq with p = n/N, q = 1 — p, and !{•} is the indicator function. 
Condition (J3]) is called the Erdos-Renyi condition. Since L-statistics can be viewed as a certain 
generalization of the sample mean, one can expect that conditions, sufficient for the asymptotic 
normality, should be similar to that used in [7], but with some additional restrictions to the weights 
ci , • • • , c n . 

On the other hand, L-statistics is a subclass of the more general class of symmetric statistics 
(symmetric functions of observations). An asymptotic behaviour of symmetric statistics differs not 
so much from that of the simplest linear statistic (the sample mean is an example), in the sense that, 
e.g., using Hoeffding's decomposition of Bloznelis and Gotze [2], we can write 

n 

S n = U 1 + R 1 , where U 1 = ^g l (X i ) (4) 

i=l 

is a linear statistic and (we expect that) R\ is a stochastically smaller statistic. Then S n in (J3J) is 
asymptotically standard normal if its linear part U\ is asymptotically standard normal, and R\ is a 
degenerate statistic as the sample size 

ra* := min{n, N — n} 

increases. In particular, by [2], the components U\ and R\ are centered and uncorrelated, and (by 
Theorem 1 of [2]) the variance of R\ is bounded as follows: E R\ < 62, where it is expected that 
the particular quantity 62 = o(l) as n* — > 00. In the present paper, we apply the general result 
on asymptotic normality of the symmetric statistics (see Proposition 3 of [2]) to the case of the L- 
statistics, i.e., we replace the condition imposed on 62 by conditions expressed in terms of the weights 
ci, . . . , c n and the population X . 

We assume, without loss of generality, that the values of the population X are arranged in non- 
decreasing order, i.e., x\ < ••• < xn- Let us use the convention (f) = for a < b. In the case of 
.L-statistic ([2]), the function gi(-) in (j3|) is represented by, for 1 < k < N, 

gi{xt) = ± c , g > k} - 1) (* - J) ( N ~* - J ) " If - ,,), (5) 
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see Ciginas [5]- Denote a\ = ~Egl(Xi). 

First, we consider an L-statistic of the general form ([1]), and we will require a certain smoothness 
of its weight function </(•)• Reformulate Erdos-Renyi condition Q: for every e > 0, 

a^ 2 Bg 2 l (X 1 )I{\g 1 (X 1 )\>era 1 } = o(l) as n* ^ oo. (6) 

Then we have the following statement. 

Theorem 1. Assume that n* — > oo and a n > c\ > for all n*. Suppose that ElJ < C2 < oo and 
that J(-) is bounded and satisfies the Holder condition of order 5 > 1/2 on (0, 1). Let © ZioW. TTien 
a^Sn is asymptotically standard normal. 

Note that, in comparison to the case of the sample mean, conditions on the finite population X 
remain very mild. Assumptions of Theorem [TJ sufficient for the asymptotic normality of L-statistics, 
are similar to that obtained by Stigler |14| in the i.i.d. case. 

Second, we consider an important special case of ([I]), i.e., the trimmed means. The trimmed mean 
is defined as follows: for any fixed numbers < t\ < £2 < 1) 

[t 2 n] 

Aftiita = ([hn] - [tin])" 1 X j:n , 

j=[hn]+l 

where [•] is the greatest integer function. The statistic Mt Vi t 2 is represented by the weight function 
J{u) = (t2 — ti)~ 1 I{ti < u < £2}- This function is not sufficiently smooth, i.e., J(u) is bounded, 
but it does not satisfy the Holder condition. Let us introduce an additional smoothness condition for 
the population X. Assume that, without loss of generality, x\ < ■ ■ ■ < xn- Suppose that, for some 
constants C > and 1/2 < 8 < 1, the inequality 

\x m -xi\ < CN~ 5 \m-l\ (7) 

is satisfied for all 1 < / < m < N. 

Theorem 2. Assume that — > 00 and a n > c\ > for all n*. Say that EIj < C2 < 00. Assume 
that (|7|) is satisfied for some 1/2 < 5 < 1, and (1 — n/N)~ l n 1//2 A^ 5_1 — > 00. Then, in the case of a 
trimmed mean, a~ l S n is asymptotically standard normal. 

In the case of i.i.d. observations, it was shown by Stigler [13] that in order for the trimmed mean to be 
asymptotically normal, it is necessary and sufficient that the sample is trimmed at sample quantiles for 
which the corresponding population quantiles are uniquely defined. Thus, the conditions of Theorem 
[2] seem too strong. On the other hand, in finite population settings, the new smoothness condition 
(|7|) has a specific interpretation. Let us take / = 1 and m = N . If the population X is bounded, then 
the condition is satisfied for 6=1. For any finite population, condition ([7]) is satisfied in the marginal 
case of 5 = 1/2. The latter fact follows from the Nair-Thomson inequality \xj\f — x\\ < a^/2N (see, 
e.g., Balakrishnan et al. pQ). Thus, condition ([7]) seems very mild for small 9 > in 5 = 1/2 + 9, i.e., 
it holds for most of possible populations. Obviously, if we are interested in the asymptotic normality 
of the trimmed means, then, by the conditions of Theorem^ for small 9 we should have n — > 00 quite 
quickly as N — > 00, while in the case of 5 = 1 it suffices that n — > 00 arbitrarily slowly with respect 
to the grow of the population size N. 
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2 Proofs 

In the proofs of Theorems [I] and [2j we assume that, without loss of generality, x\ < • • • < Xjv- 

Proof of TheoremUl First, we show that a n is bounded as n* — > oo. Then the condition: for every 
e > 0, 

n*Egl(X 1 )I{gl(X 1 )>e} = o(l) as -> oo (8) 

of Proposition 3 in Bloznelis and Gotze [2] is equivalent (see ibidem) to condition ([6]). It is shown by 
Ciginas and Pumputis [6] that, for any symmetric statistic, the inequality 



n 



5i<-n[l- — ) 



hS n ) 



(9) 



holds. Here B>iS n = 5 , n (Xi\{X n+ i}) — 5 n (Xi\{Xi}), where Xi = {X\, . . . , X n+ i} is the extended 
sample. Introduce the events fRi-ij = {Ri-2 = i,Rn+i-.2 = j}, 1 < i <J < n + l, where R\-2 < 
R n+ i : 2 denote the order statistics of the ranks {R±, R n +i} of {X\,X n+ i} in the set Xi. Here all 
ranks {R±, ■ ■ ■ , i? n +i} of Xi are distinct if, in the case of ties on X, we order (select ranks for) tied 
observations randomly with equal probabilities. The probabilities of the events are 



pi-ij := P{9*i ;i j} 



n + 1 
2 



Since J(-) is bounded, there exists an absolute constant a that 

max \c p \ < a 

l<p<n 

for all n. By Lemma 2 of [5] and (jlOp . we obtain 



(10) 



E (Di5 n ) 



l<i<j<n+l 



iS n y 



< n" 1 Yl E 

l<i<j<n+l 



fln+l:2~l 

P=Rl:2 

2 



< a 2 n~ x E [i X r-n+i ~ X* 



n+l) 



l<i<j<n+l 



(11) 



where A p:n+ i = X p+ i :n+ i — X p:n+ i, 1 < p < n denote the spacings of the sample Xi. Since the events 
and %$i'ijim = {Xi :n+ i = xi,Xj :n+ i = x m }, 1 < / < m < N are independent, for x\ < ■ ■ ■ < xn 
we get 

TV 



Pl;ijlm • — P {®l;tj'Zm I 



m — I — 1 



N — m 
n + 1 — j 



n+l 



For x\ < • • • < xtv these probabilities are the same. It follows from an argument similar to Lemma 
2.1 of Balakrishnan et al. pQ. We also have that, by the generalized Vandermonde identity, 



^ Pl;ijlm 
l<i<j<n+l 



N 
n + l 

N 
n + l 



-1 n— 1 n— 1 — ; 

£ £ 

s=0 t=0 



I - 1 

s 



m — I — 1 



N — m 
n — 1 — s — t 



N 

n - 
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Then note that 



and continue (1111): 



1 = ^2 E ( x " 



Kl<m<N 



E(Di5 n ) 2 < a 2 ™" 1 

l<i<j<n+l 
-1 



2 -1 / n + 1 



l<l<m<N 

1 'N -2 
n 



xi 



Pi-ij 



7 Kl<m<N 



N 



N-l 



VarX 



Finally, from Q we get 



iV — n 

a 2 < a 2 VarXi = Oil) as n* — >• oo. 

n — N — i v ' 



Second, we show that, under the conditions of the theorem, the condition 5 2 (S n ) = o(l) as re* — > oo 
of Proposition 3 in [2] is satisfied. Here 5 2 (S n ) = E (n^B> 2 S n ) 2 , where 

B 2 S n = S n (X 2 \{X n+1 ,X n+2 }) - S n (X 2 \{X 1 ,X n+2 }) - S n (X 2 \{X 2 ,X n+1 }) + S r n (X 2 \{X 1 ,X 2 }) 

with the extended sample X 2 = {X±, . . . , X n+2 }, see [2]. Similarly, introduce the events £K 2; ij = 
{R 2 -a = i,R n+ iA = j}, l<i<j<n + 2, where R1.4 < R 2 -a < Rn+i-A < Rn+2A denote the order 
statistics of the ranks {Ri, R 2 , R n +i, Rn+2} of {Xi, X 2 , X n +i, X n+2 } in the set X 2 . Now 



p 2 -ij := P{<R 2 ;ij} 

We also similarly have 

P2;ijlm '■= P {*&2;ijlm I ^2;ij} = 



i - 1\ (n + 2 - j 



n + 2 
4 



(12) 



N 
n + 2 



'l-l\(m-l-l\f N-m 
< i-l)\j-i-l)\n + 2-j J 

where the events £K 2; ij and %$ 2 -ijim = {Xi- n +i = X[,Xj ]n+2 = x m }, 1 < I < m < N are independent. 
Since J(-) satisfies the Holder condition of order 5 > 1/2 on (0, 1), we find that 



J 



p 



n + 1 



J 



p — 1 
n + 1 



< £(n + l) 



or 



max \c„ — c p -\\ < B(n + 1) , for some 5 > 1/2. 

2<p<n 



(13) 



By Lemma 2 of [5] and (|13p . we obtain 
5 2 (5 n )=n 2 E 

l<i<j<n+2 



P2;ij 



Rn + l-A — 1 



^ (cp - c p _i)A p:n+2 J 



P=R2A 



< nln- 1 E 

l<i<j<n+2 

< B 2 nln-\n + l)" 25 ^ E [(X J:n+2 - X r . n+2 f 

l<i<j<n+2 

= B 2 n 1 ~ 2S ^IrniXm-Xi) 2 , 
Kl<m<N 



9^2;ij 
9^2;ij 



P2;ij 
P2:ij 



(14) 



5 



where X 2 -im = '£ l i<i<j<n+2P2;ijP2;ijlm- Taking j = i + 1 and applying max <„<i u(l - u) < 1/4, for 
all 1 < i < j < n + 2, we get the inequalities 



1 • i / 1 \ 1 / , n\ -1 

2 f " 



y n + 2\~i-l 1 o/n + 2 



,5...* w " = (»+ii) r» 2 ) 



A 2; ;m < T n 2 ( ) ( _ 1 I ) < 24iV~ 2 



Then, noting that, by the generalized Vandermonde identity, 

N \~ViV-2 N 

^2-Ajlm — I 

l<i<j'<n+2 

we obtain, for all 1 < Z < m < N, 

4 j \n + 2j \ n 

Finally, it follows from this bound and (|14p that 

Wn) < 24BV~ 2<5 VarXi = o(l) as n* -> oo. 

All the conditions of Proposition 3 in [2] are verified. Thus, the theorem is proven. □ 

Proof of Theorem^ First, we show that condition ([7]) with (1 — n/N)~ l n l l 2 N s ~ l — > oo imply (|8|). 
Noting that 

A (i - 1\ /iV - j - 1\ (N - 



j — 1/ V re — j I \ n — 1 



applying (fTOjl and (JZJ), we get from (J5J), 

JV-l 

max oi (xk)\ < aCri 

-1/2 N -S 

max > 

l<k<N ]y K M ~ l<k<N 



i=l 

Therefore, for a fixed e > 0, 

l{\9i{X{)\ >e}<l{ max \ 9l {x k )\ > e\ < it^-n^N 1 " 6 > e\. 

y. l<k<N ) V Z ) 

We obtain from here and from (3.9) of [5] that 

n*Eg 2 (Xi)I{\gi(Xi)\ > e} < l{ ^n^N 1 - 5 > e}n*Bg 2 (Xi) < 4a 2 l{n- 1 / 2 iV 1 -< 5 > ^JeL^I 2 . 
Condition ([8]) is proven. 

Second, as in the proof of Theorem [H we verify the condition 5 2 (S n ) = o(l) as re* — > 00. Write, 
for short, s = [tin] + 1 and t = [t 2 n]. Similarly, applying Lemma 2 of [5], we obtain 

S 2 (S n )< Y, P^VAl( S ,t), (15) 

l<i<j<n+2 
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where p2-%j is given by (fT2|) and 



i4jj(s,t) = y^(c p - Cp_i)Ap :n+2 with c p = I{s < » < t}. 



p=i 



We can assume, without loss of generality, that n > (£2 — ii) • Then we have s < t. It also follows 
from the inequality fan] — [tin] > t 2 n — 1 — tin and from the same assumption that, for some constant 
Ci >0, 



n 



<[t 2 -ti 



<Ci. 



(t-s + l) 2 

Let us decompose X = {(i, j) :2<i<j<n + l}, for fixed s < t, into mutually disjoint subsets 

t + 2 < i < j < n+1}, 
2<i<j<s}, 
s + l<i<j<t + l}, 
s + l<i<i + l, t + 2<j <n + l}, 
2<i<s,s + l<j<t + l}, 
2 < i < s, t + 2 < j < n + 1}, 

such that I = X\ U • • • U Iq. Then we get 

'() 



(16) 



Ti 


= {(iJ) 




= {(i,j) 




= {(i,j) 




= {(i,j) 




= {(i,j) 




= {(},3) 



Aij (s, t) 



if el 1 ux 2 ux 3 , 

-ct Af+i :re+2 if G X 4 , 

c s A s:n+2 if G X 5 , 

^c s A s:n+2 - c f A i+ i :ti+2 if G X 6 . 

Now, by collecting the terms of the sum Yli<j w hh the same value of EAf-(s,t) in (fT5j) . applying 



E(A 



t+l:n+2 



A, : 



s:n+2j 



< EA 



t+l:n+2 



+ E A 



s:n+2 



E A 2 :n+2 ; an d a l so invoking inequality (fl~6j) . we obtain 



, and then collecting terms with E A 2 +1:n+2 and 



<M<SW) < Cin\n 



i(n + 2 



+ 



t + 1\ (n - t + 1 



E A 



{n-t + lf 



t+l:n+2 

n — s + 1 
2 



E A 



s:n+2 



(17) 



By applying the simple inequality ("J < u v /v\, we derive 

t + l 

n + 2 



t + l\fn-t + l\ (n + 2) 4 
2 ) ~ 4~~ 



t + l 
n + 2 



< 



(n + 2) 4 
64 



(18) 



Taking s = t, very similarly we get 



n + 1 



n+1 



< 



(n+1) 4 
32 



(19) 
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Next, it is easy to calculate (invoking Lemma 2.1 of Balakrishnan et al. pQ) that, for 1 < p < n + 1, 



l<Km<N 

Then, using (J7]), we obtain 



E ^ + -( n : 2 )" 1 .,Ej;:;)( m - i - 1 )(„ ,v + ;- m j<--^ 



FA 2 < C V iV V 1 V f m n 2 //-lU iV-m \ _ C 2 (g + l)(2iV - n) 
t,/X p . n+2 < WE y n + 2 j ^2^™ l ) [ p - 1 )[n + 1 - p ) - (n + 3)(n + 4) ' [2U) 



Here the last equality is obtained by applying simple binomial identities (™) = x(T-i)' ^ — ^ — m ' 



'j\ fm + l\ n ^ , ^ j (p\ (m — p\ fm+1 

K k) = \k^ 

Indeed, for instance, 



l-l\/N-m\ A f^/i-l\l o/N-m 



Eo (m — 1\ f N — m \ / m \ ( N — m \ 

m=2 w / x ' m=2 x 



N — m 

P 



and so on. Finally, applying (fl~8j) . (TT9l) and (l20j) . and < 2n(l — n/N), we continue (IPTj) : 



5 2 (5 n ) < Cmin 



2 _ 1 /n + 2\~ 1 C 2 (JV + l)(2JV-n) 



4 / 7V 2<5 (ra + 3)(n + 4) 



(n + 2) 4 (n + 1) 41 



+ 



64 32 



/ n\2jV 2 ( 1_<5 ) 

< C 2 1 - — = o(l) as n* -> oo, 

v TV / n 

for some constant C 2 > 0. The theorem is proven. □ 
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